-
-
Notifications
You must be signed in to change notification settings - Fork 31.6k
gh-129987: Selectively re-enable SLP autovectorization of _PyEval_EvalFrameDefault #132530
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…ewer GCCs, as the optimization bug seems to exist only on GCC 12 and later, and before GCC 9 disabling the optimization has a dramatic performance impact.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM - thanks for the thorough investigation!
For the record, I've run rerun the benchmarks on GCCs 7-15 with this change to verify it fixes the performance drop on old GCCs and doesn't affect new GCCs. |
|
(Buildbot error is noise; it's been failing like that for ages.) |
Only disable SLP autovectorization of
_PyEval_EvalFrameDefault
on newer GCCs, as the optimization bug seems to exist only on GCC 12 and later, and before GCC 9 disabling the optimization has a dramatic performance impact.DISPATCH
calls #129987